Wavelet Cesptral Coefficients for Isolated Speech Recognition

نویسندگان

  • T. B. Adam
  • M. S. Salam
  • T. S. Gunawan
چکیده

The study proposes an improved feature extraction method that is called Wavelet Cepstral Coefficients (WCC). In traditional cepstral analysis, the cepstrums are calculated with the use of the Discrete Fourier Transform (DFT). Owing to the fact that the DFT calculation assumes signal stationary between frames which in practice is not quite true, the WCC replaces the DFT block in the traditional cepstrum calculation with the Discrete Wavelet Transform (DWT) hence producing the WCC. To evaluate the proposed WCC, speech recognition task of recognizing the 26 English alphabets were conducted. Comparisons with the traditional Mel-Frequency Cepstral Coefficients (MFCC) are done to further analyze the effectiveness of the WCCs. It is found that the WCCs showed some comparable results when compared to the MFCCs considering the WCCs small vector dimension when compared to the MFCCs. The best recognition was found from WCCs at level 5 of the DWT decomposition with a small difference of 1.19% and 3.21% when compared to the MFCCs for speaker independent and speaker dependent tasks respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DWT and LPC based feature extraction methods for isolated word recognition

In this article, new feature extraction methods, which utilize wavelet decomposition and reduced order linear predictive coding (LPC) coefficients, have been proposed for speech recognition. The coefficients have been derived from the speech frames decomposed using discrete wavelet transform. LPC coefficients derived from subband decomposition (abbreviated as WLPC) of speech frame provide bette...

متن کامل

Mel Frequency Discrete Wavelet Coefficients for Kannada Speech Recognition using PCA

In this paper, a new scheme for recognition of isolated words in kannada Language speech, based on the Discrete Wavelet Transform(DWT) and PCA has been proposed. First, the DWT of the speech is computed and then MFCC coefficients are calculated. For this, Principal Component Analysis procedure is applied for speech recognition. This paper also presents the comparative results with respect to th...

متن کامل

New Feature Extraction Techniques for Marathi Digit Recognition

In this paper a new efficient feature extraction methods for speech recognition have been proposed. The features are obtained from Cepstral Mean Normalized reduced order Linear Predictive Coding (LPC) coefficients derived from the speech frames decomposed using Discrete Wavelet Transform (DWT). In the literature it is assumed that the speech frame of size 10 msec to 30 msec is stationary, howev...

متن کامل

Robust Speech Recognition Using Wavelet Coefficient Features

We propose a new vein of feature vectors for robust speech recognition that use denoised wavelet coefficients. Greater robustness to unexpected additive noise or spectrum distortions begins with more robust acoustic features. The use of wavelet coefficients is motivated by human acoustic process modelling and by the ability of wavelet coefficients to capture important time and frequency feature...

متن کامل

Distance Measures for Wavelet Representation of Speech Segments

Dyadic scheme of wavelet signal decomposition leads to a specific division of frequency bands. It is comparable to mel-frequency division and may be used in effective parameterization of speech signal in recognition systems, speech coding or other speech signal based applications. This paper discusses efficiency of different spectral distance measures applied to wavelet-parameterized speech. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013